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WEAK MONADIC SECOND ORDER THEORY OF SUCCESOR 

is not elementary-recursive "*" 

By 
Albert R. Meyer 

Let L SIS be the set of formulas expressible in a weak monadic second 
order logic using only the predicates [x = y+1 ] and [x € X], Buchi and 
Elgot[3,4] have shown that the truth of sentences in L (under the standard 
interpretation < N, successor > with second order variables interpreted as 
ranging over finite sets) is decidable. We refer to the true sentences in 
L SIS as WSIS. We shall prove that WSIS is not elementary-recursive in the 
sense of Kalmar. In fact, we claim a stronger result: 

Theorem 1 : There is a constant € > such that if K is a Turing machine 
which, started with any sentence in L^ on its tape, eventually halts in 
a designated accepting state if and only if the sentence is true, then for 
all sufficiently large n, there is a sentence of length n + * for which W s 
computation requires 

2 n 



2'' ( L6'log 2 nJ 



steps and tape squares. 



f. ■ — 

- By the length of a sentence we mean the number of characters in it 
including parentheses, digits in subscripts, etc. Any of the standard 
conventions for punctuating well-formed formulas may be used, except that in 
some cases conventions for matching parentheses may imply that for infinitely 
many n, there cannot be any wff's of length n. In this case, we assume that 
wff s may be lengthened by concatenating a finite sequence of "blank" 
symbols which leave the meaning of the wff unchanged, so that sentences of 
length n can be constructed for all sufficiently large n. 

^rant'cj^Sr^r'T'^ by . th \ N f ional Sc -nce Foundation under research 
grant GJ-34671. Reproduction in whole or in part is permitted for any 
purpose of the United State Government. W 
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t, (n) 
Let t (n) = n, t (n) = 2 k .A well-known characterization of 

ri4i 

the elementary-recursive functions by R.W. Ritchie shows that a set of 
sentences is elementary-recursive iff it is recognizable in space bounded by 



t k (n) = 



2 2 " 




for some fixed k and all inputs of length n £ 0. Hence, WSIS is not 
elementary-recursive. 

In these notes we prove a somewhat less powerful version of Theorem 1, 
which by Ritchie's result is still sufficient to establish the truth of 
our title. 

Theorem 2 : Let 93! be a Turing machine which, started with any sentence in 

L on its tape, eventually halts in a designated accepting state iff the 
sentence is true. Then for any k ^ 0, there are infinitely many n for 
which 931' s computation requires 

■>n 



2 2 



r 
r' i k 



steps and tape squares for some sentence of length n. 



The idea behind our proof will be to show that there are sentences in 
L of length n which describe the computation of Turing machines, provided 

the space required by the computation is not greater than t (n). Since 

a Turing machine using a given amount of space can simulate and differ from 



all machines using less space, we will deduce that small sentences in L „ 

can describe inherently long computations, and hence L must itself be 
difficult to decide. 

Actually it will be more convenient to develop an intermediate notation 
called y-expressions for sets of finite sequences. We will show that 
y-expressions can, in an appropriate sense, describe Turing machine 
computations, and that L can describe properties of y-expressions. 

O ID 

Definition : Let E be a finite set whose elements are called symbols. 

£ is the set of all finite sequences of symbols from T. For x, y € T, , 
the concatenation of x and y, written x«y or xy, is the sequence 
consisting of the symbols of x followed by those of y. An element x £ £ is 
called a word, and the length of x is written 4(x). We use \ to designate 
the vacuous sequence of length zero in T- which by convention has the 
property that x'A. = \«x = x for any x £ £ . (E is the free monoid with 
identity X generated by £). ) Concatenation is extended to subsets 
A, B c E by the rule 

A«B = AB = {xy | x € A, y € B] . 
For any A c 2 , we define 

, , , , n+1 . n . * i 00 , . n 
A = [\] , A = A -A, A = n U A . 

These operations are familiar in automata theory. We introduce one 
further mapping. 

Definition : For any £, the function y„: P(S ) -> P(£ ) is defined by the 
rules 

P(S) = {A I ACS] = the power set of S. 
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Y E ({x)) = [y 6 T* | A(x) = 4(x)} = E* 00 for x € £*, 

Y V (A) = U Y({*}) for A c E*. 

27 x€A 

We omit the subscript on y_ when E is clear from context. 

y-expressions over E are certain words in (E U (y, ', -i, U, (, )} ) where 

y, • , -i, U, (, ) are symbols not in E. Any y-expression a defines a 

set L(o0 c: E . 

Definition : For any E, y- expressions over E and the function 

L:{ y-expressions over E} -* P(E ) are defined inductively as follows: 

1) a is a -y-expression over E for any <r 6 E, and L(cr) = {ct} ; 

2) if cv, p are y-expressions over E, then ( a • P 2' i, ^ ~ ^ ~' 
—1 ( a ) , and y ( a ), are y-expressions over E, and 
L(U- P)) = L(o)'L(p), LU a U P)) = L ( ff ) U L ^>' 

L( -1 ( a )) = E X -L(a), and L(y ( a )) = "y(L(ar)) t 

3) That's all. 

Having thus made clear the distinction between a y-expression 
a and the set L(a) it defines, we will frequently ignore the distinction 
when there can be no confusion. Thus we write E = <J U -i(ct) instead of 
L" = L( ( cr U -1 ( a ) ) ). Similarly, for any set of letters V c: E, 



v = -,(£* • (E-v).E f ) 

* * 

since V consists precisely of those words in S which do not contain a 

symbol not in V. Thus there is a y-expression a over E such that L(o0 = 



V . DeMorgan's law gives us intersection, and then the identities 

V n = E n n V*, and 
S n = Y(V n ) 

imply that from a -^-expression of length s for £ n we can obtain a 
Y-expression of length s + c for V n , and conversely from V n to if 1 , for 
some constant c and all s, n € N. We shall show below that in general 
s may be much smaller than n. 

Definition : Empty (£) = [a | a is a y-expression over D and L(a) = 0} . 

Since the regular (finite automaton recognizable) subsets of S" are 
closed under •, U, -i, and y, it follows that Empty (S) is recursive and 
in fact primitive recursive. One simply constructs a finite automaton 
for L(qO and tests whether the automaton accepts some word; there are 
well-known procedures to do this. A priori analysis of this procedure 
however indicates that from deterministic automata for y-expressions a, 
(3 one would obtain a non - deterministic automaton for a* (3 or y(aO, and then 
would have to apply the "subset construction" of Rabin-Scott [Bj to 
obtain an automaton for -i(q" (3) or -iy(a). Since the subset construction 
can exponentially increase the number of states in the automaton, 
Y-expressions in which k complementations alternated with y' s a nd 
concatenations can lead to an automaton with t (2) states. The time and 
space required by a Turing machine which recognizes Empty (£) by the 
procedure outlined above can be bounded above by 




for some constant c and all ^-expressions of length n £ 0. It will follow 
from results below that such absurd inefficiency is inevitable. 

Definition : A Turing machine 301 recognizes a set A c= J? if, when started 

* 
with any word x € S on its tape, HR halts in a designated accepting 

state iff x 6 A. 

Let f : N -» N . The space complexity of a set A c E" is at most 

_f almost everywhere , written 

SPACE (A) =S f (a.e.) 

iff there is a Turing machine which recognizes A and which, for all but 
finitely many x € A, uses at most f(i(x)) tape squares in its computation 
on input x. The space complexity of A exceeds f infinitely often written 

SPACE (A) > f (i.o.) 

iff it is not true that SPACE (A) <; f (a.e.)- 

We shall use Turing's original one tape, one read-write head model 
of Turing machine, and define the number of tape squares used during the 
computation on input x to be the larger of i.(x) and the number of tape 
squares visited by the read-write head. Then by convention at least 
max( i.(x), 1} tape squares are used in a computation on any input word x. 

We briefly review some well-known facts, first established by Stearns, 
Hartmanis, and Lewis [15], about space-bounded Turing machine computations. 



-7- 

Definition: A function f : N -> N is tape constructible iff there is a 
Turing machine which, started with any input word of length n £ 0, halts 
having used exactly f(n) tape squares. 

Fact_l: t Q + 1 = \n[n+l] is tape constructible. For any k > t is 

' k 
tape constructible. 

Fact 2 : If f: N "* N is tape constructible, and SPACE (A) <; f (a.e.) for 
some ACE, then there is a Turing machine which recognizes A which halts 
on eyery_ input x € L using at most f(£(x)) tape squares. 
Hence, SPACE (A) s; f « SPACE (if-A) <: f. 

Fact_3: If f: N -+ N is tape constructible, then there is an A <z {0,1}* 
such that 

SPACE (A) <; f and 

SPACE (A) > g (i.o.). 

for any g : N -+ N such that 

lim °; ( = 0. 
f(n) 

n->°= 
Our proof consists of a sequence of reductions of one decision or 
recognition problem to another. In contrast to the usual reductions of 
recursive function theory, our reductions must be computationally efficient. 
We introduce a particular notion of efficient reduction which is sufficient 
for our purposes. 

Definition: Let E £ be finite sets of symbols, and A cV* A c E* 

i ^ 1 1 ' 2 2 ' 

A 1 is efficiently reducible to A written 



A x £ff A 2 



providing there is a polynomial p and a Turing machine which, started with 
any word x £ E } on its tape, eventually halts with a word y G l" on its 
tape such that 

1) x € A 1 » y G A and 

2) the number of tape squares used in the computation on input 
x is at most pU(x)) (and a fortiori i(y) <, p(j£(x))). 

We remark that all the reductions which are described below require 
only a linear polynomial number of tape squares and a polynomial number of 
steps, but to minimize the demands on the readers intuition (since we never 
actually give a flow-chart or table of quadruples for the Turing machines 
we describe) we allow polynomials of any degree. Even so, eff is much more 
restricted than is necessary to prove Theorem 2. 

Fact Zf - eff is a transitive relation on sets of words. 

Fac^_5. If A x eff A 2 and SPACE (A,,) s f (a.c), then there is a polynomial 

p such that 

SPACE (A^ <. \n[ max{f(m) | m <; p(n)} + p(n) ] (a.e.) 
Fact 6 " If A ! eff A 2 a nd SPACE (A^ > t fc+1 (i.o.), then SPACE (A 2 ) > t (i.o.) 

Proof. Immediate from Fact 5 and that observation that for any 
polynomial p, t k (p(n)) + p(n) ^ fc k+1 (n) for a H sufficiently large n. 

The proof of Theorem 2 can now be summarized. 

Proof of Theorem 2 : We will establish below that 

Empty ((0,1}) eff WSIS 
Emp_ty (I) eff Empty ({0,1}) for any finite I, 
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and finally that for any k and for any set A c (0,1} such that 
SPACE (A) ^ t, (a.e.) there is a finite S such that 

A eff Empty (?) 

From fact 4, we have A eff WSIS for any A and k such that 
SPACE (A) £ t (a.e.)- 

Then from facts 1, 3 and 6 we conclude that SPACE ( WSIS ) > t (i.o.) 
for any k. Q.E.D. 

It remains only to establish the required reductions. 

Lemma 1 : Emptv CfO,!} ) eff WSIS 

Proof : For any y-expression a over {0,1} we shall show how to construct 

a formula F € L„ T „ with two free integer variables and one free set 
a SIS 

variable. For any set M <= N, let C : N -> (0,1} be the characteristic 

function of M that is, C„,(n) = 1 » n € M. The formula F will be 
» ' M a 

constructed so that for n, m € N, M c N, M finite: 

F (n,m,M) is true » [ [n < m and C (n) •C vr (n+1) ... C_,(m-1) € L(a) ] 
a MM M 

or [n = m and \ € L(a)]]. 

F is constructed by induction on the definition of y-expressions. If a 

a 

is or 1, then 

F Q (x,y,X) is [y = x+1 and -, (x £ X) ] , 
F (x,y,X) is [y = x+1 and x € X]. 

If a is ( p • 6 ), then 

r**s r>*j f**s 

F (x,y,X) is (3z) [x <■ z and z £ y and F (x,z,X) and F.(z,y,X)]. 
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If a is y ( 3 ), then 



F a (x,y,X) is (3X )[F (x,y f X )]. 



If a is ( P JJ ^ or n ^ p 2' then F is f F R — F 6 ^ or ^ x ^ y — "^ F p( x >y» x )]» 

respectively. 

It is clear that there is a Turing machine which, given an input 
ot € [0, 1, ( , ), U, 7, • , -i ] , can test whether a is a well-formed 
-y-expression and, if so, print out the sentence 

-i (3x)(3y)(3X)[F a (x,y,X)], 

never using more space than some fixed polynomial in Z(ot) . (If a is not 
well-formed, the machine prints out (3x) [x = x+1].) Hence, Empty ((0,1} ) 
effWSIS- Q.E.D. 

It will be convenient to work with larger symbol sets than (0,1), but 
a trivial coding will demonstrate that this involves no loss of generality. 

Let £ be any finite set of symbols with |S| ^ 2. Say 0, 1 € £, 

0^1. Then for any n^ 1, there is a y-expression over E for (27) . 

* n-1 * 
To see this, consider a word in T, not in (0 1) . Such a word either 

fails to begin with 1, fails to end with 1, or contains a subword in 
Zf 1 " 1 (Z>0) or 1 Z7 n " 1 (Z>-l). Hence 

\ U- 1 ((0 n ' 1 l) ,V ) = -n(0 n-1 l £*) U -.(E* 1) U(f V if^S-O) £*) 

U(E* 1 2f _1 (E-l) if), 

and so, noting that \ = -i(2>£ ), we have 

(o 11 " 1 1)* = -i(\ U -n((0 n-1 1)*)) U K 

and 

n * n-1 * 
(E n ) = 7((0 1) ). 
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Now given any finite set £ choose n sufficiently large that 
'£ I s 1^1 and let h: S "♦ 27 be any one-one function. Extend h to a one-one 
map from P(£ ) into P((£ n )*) by the obvious rules h(X) = X, 

h(xa 1 ) = h(x)-h(CT 1 ) for x £ E^, c^ € Z^, and h(A) = (J {h(x)}for A c £ * 

x€A 

There is then a y-expression over £ for MZ^"), because a word fails to be 

■>'< 
in h(^ ) either because its length is not a multiple of n, or else because 

it contains a subword of length n not in h(£ ) which begins at a position 
congruent to one modulo n: 

£*-h (E 1 *) = n( (£ n ) *) U (£ n ) * • (£ n -h (£ x ) ) • (£ n ) * . 

Lemma 2 : (Coding) Let £ £ be finite sets of symbols with |£| £ 2. 

Let h: P(£ ) -> P((£ )*) be the extension of a one-one function from 
£^ to £ for some n £ 1. There is a Turing machine which, started with 
a y-expression a over £ , halts with a y-expression p over £ on its tape 
such that 

h(L(a)) = L(P). 
Moreover the space used during the computation with input a is bounded 
by a polynomial in &(ot) . 

Proof . The transformation of a to p operates by applying the following 
rules recursively. 

If a € £., P is set equal to an expression for h(L(cv)). 

If a is ( q- • a ) or ( a. U cO , then p is ( B, • p„) or 

i. P]_ ,U P 2 ^ ' res P ectivel y» where p , P 2 are the transforms of a , ct . 
If a is ^ ^ a ^ , then p is 



Finally, if a is -i ( a. ) , then p is -i ((B U B_ ) ) , since 
1~ ~ 1 ~ 21 
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n((n(T(P ))UL )) 

where p is the transform of a and L is a -y-expression over E for 

s'-h^'V (Note that h(y (A)) = Y s (h(A)) ME^) for A c J^*, whicl 

justifies this rule.) 

-i ( rv ) . thfin R is -, CCR lip 

h(E : ,V -A) = h^*) -h(A) = £* -(h(A) U (Z-hQ^*)) for A c^*. 

It is clear that a Turing machine can carry out this recursive 
transformation within the required space bound. Q.E.D. 

Corollary : Empty (E) eff Empty {0,1} for any finite E. 

Proof : Code E into (0,1} via h as in Lemma 2. Then a € Empty (E) «* 

L(o0 = « h(L(aO) = *» L(p) = » p € Empty {0,1}. Q.E.D. 

We now show how, given a -y-expressiori for E , one can construct 

a -y-expression of about the same size describing any desired computation 

of a Turing machine, providing the states and symbols of the Turing 

machine can be represented in E and the computation only requires n tape 

squares. This construction will be applied recursively to obtain 

t k (n) 
-y-expressions of size n for £ , and will then finally be used to 

conclude that A eff Empty (E) for any A c {0,1}' such that SPACE (A) <: 
t k (a.e.). 
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Definition : Let 331 be any Turing machine with tape symbols T and states 

S. Assume b (E T where "b" designates a blank tape square. An 

+ 
instantaneous description (i.d.) of 331 is a word in (T U (S x T)) which 

contains exactly one symbol in S X T. Given any i.d. x = y«(s,t)»z 

for y, z€T,s€S, t€T, the next i.d., Next^(x) is defined as follows: 

if when 331 is in state s with its read-write head scanning symbol t, 331 enters 

state s' and writes symbol t' € T, then Next (x) is 

y*(s', t')«z if 331 does not shift its head, 
yt'fs', u)«w if SCR shifts its head right and z = uw 

it 

for u € T, w € T , 
w«(s', u) # t'«z if 331 shifts its head left and y = wu 

for u € T, w € T . 

Next_(x) is undefined if (s,t) is a halting condition, or if (s,t) is 
the rightmost (leftmost) symbol of x and 331 shifts right (left). Let 
Next c ro (x,0) = x if x is an i.d., undefined otherwise; NextL _(x, n+1) = 
Next^(Next^(x,n)). 

Finally, let # be a symbol not in T U (S X T). The computation 
Comp(331,x) of 331 from x is singleton set consisting of the following word 
in ({#} U T U(S X T))*: 

Comp(331,x) = {#'Next^(x,0)-#'Next^(x,l)«# #«Next^(x,n)'#} 



f S X T = { (s,t) | s € s and t € T) . We assume T fl (S X T) = 
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where n is the least integer such that (q , t) occurs in Next<™(x,n) for 
some t € T and designated halting state q . Comp(TO,x) = if there is no 

3. 

such n. 

Remark : Note that our definition of computation differs from the one 
commonly in the literature. The computation Comp(TO,x) is defined for 
i - d - ' s x » not input words x. Moreover, all i.d.'s in Comp(TO,x) have 
exactly the same length. A key property of Comp(TO,x) is given next. 

Fact_7: Given 3Jt as in the preceding definition, let £ = {#} U T U (S X T). 

Then for any i.d. y € L'\ the n-l St , n th and n+l St symbols of y 
uniquely determine the n symbol of Next^~(y) for 1 < n < &(y) providing 
Next^.(y) is defined. 

Hence, there is a partial function ^: S 3 -t J such that if cr ,ct ,ct 

. , ,st th ■ st 
are the n-1 , n , n+1 symbols of Comp(TO,x), then ^j^-,' ^' ^ iS the 

st 
n+X(x)+l symbol of Comp(TO,x) for 1 < n < 4(Comp(TO,x) )-4(x) and any i.d. x 

such that Comp(TO,x) ^ 0. Also, ^(^.ct^o^) = 0, if ^ 2 € (S X T) and 
ct 2 is a halting condition of TO. 

Lemma 3 : (Simulation) Let TO be a Turing machine with states S, symbols T, 

and designated halting state q £ S. Let £ = (#} U T U(S X T) . There is 
a Turing machine 3? (TO) which, started with any word y#«o/ on its tape 
where y is an i.d. of TO and a is a y-expression over £ such that L(qO = 
£ for some n > 0, halts with a y-expression p over £ such that 



L(p) = Comp(TO,b n ^yb n ). 



-15- 

Moreover, there is a polynomial p such that ?(5]?) never uses more than 
p(4(y#«o0) tape squares in its computation. 

Proo_f: We shall describe how to construct the y-expression p for 
Comp(9Jl,b n yb n ) from y#»a where L(a) = I?\ We begin by noting that the 
words in Zf not equal to Comp(iTU,b n yb n ) , i.e., -n(Comp(5J},b n y b n ) ) , can be 
characterized as follows: 

1) words that do not begin with #b n y b n #, or 

2) words that do not contain q , or 

a 

3) words that do not end with #, or 

4) words that violate the functional condition determined by 
f~. in Fact 7. 

These four sets of words can also be described by the formulas 

i') -i(#»(L(a) n b").y.(L( ff ) n b'V#'S""), 

2') -n(Zf.({qJXT).£*), 

a 

3') -,(E*.#), 



4') U 

O" 



1 ,cr 2 ,a 3 eD [E "- CT i ff 2 CJ 3 ,L(Qr), ^ (y) " 1,L ^-( S -¥ CT l' cr 2' ff 3 )),Z: *: 



But it is easy to see how to construct y-expressions directly from 
(l')-(4 T ), and therefore p is simply the complement of the union of these 
four expressions. Note that £(p) s c - j& (y#cv) for some constant c which 
depends only on DJi, and not on y or a. Moreover a Turing machine 
y(OT) which constructs p from y#a need never use more tape squares than 
&(P), and so certainly runs within a polynomial space bound. Q.E.D. 
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Definition : A £-t -TM is a Turing machine such that for some polynomial 
p, some function f fc S: t , and all n > 0, when the Turing machine is 



,n 



started with on its tape, it halts with a word a on its tape such that 

t(n) 

1) Of is a -y-expression over £ and L(cy) = £ , 

2) the number of tape squares used in the computation is at most p(n), 



Lemma 4 : If there is a £'-t -TM for any finite £' , then there is a 

Z>t -TM for any £ such that |s| 2: 2. 

Proof: Code £' into £ as in Lemma 2. Details are left to the reader. 

Q.E.D. 

Lemma 5 : For any k £ and any £ with |£| £ 2, there is a £-t -TM. 

JcC 

Proof: A £-t -TM simply prints an expression for y(a n ) from input n , 

where cr € £ is any symbol. Proceeding by induction, assume there is a 
£-t -TM. Let 331 be a Turing machine which, started with n on its tape 
for any n > 0, lays out t, (n) tape squares on its tape and then uses 
these tape squares to cycle through some number f, (n) S 2 = t (n) 

steps before finally halting. Since t is tape-constructible, it is easy 
to obtain 931 as described. Choose £ as in the simulation lemma applied 
to TO . 

The £-t, -TM operates as follows: Given , use the £-t, -TM to 

f k (n) 
obtain a such that L(ce) = £ . Apply? (331) of the simulation lemma 

to (q Q ,0)0 •#•£* where q is the start state of 331 . This yields a 
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f k (n) n _ x f , 
y-expression |3 such that L(3) = Comp(93t,x) where x = b «(q ,0)0 k 

But CompCDl ,x) is defined since 931 halts on input within t, (r.) £ f v ^ n ) 

tape squares. Moreover, j6(Comp(93l ,x)) S: t (n) since 971 runs for at 1-ast 
t, (n) steps. Hence, the output of the £-t -TM is simply y(|3). 

Since by hypothesis a is obtainable in space p (n) for some polynomial p , 
and similarly p is obtainable in space p„(n+l + p..(n)) for some polynomial 
p , the entire process requires only polynomial space. Q.E.D. 

Lemma 6 : For any set A c {0,1) , if Comp(A) s t, (a.e.) for some k £ 0, 
then there is a finite S such that A eff Empty (£) . 

Proof : Let 9JI be a Turing machine which recognizes {0,1} -A and for 

every x € {0,1} , 931 halts using at most t (^(x)) tape squares. By 
Fact 2, there is such an 93?. 

Choose S as in the simulation lemma applied to 93). 



The Turing machine which efficiently reduces A to Empty (£) operates 

k" 



* 
as follows: given x € {0,1] , use a £-t, -TM to obtain a y-expression 



f k (n) 
a such that L(aO = L for n = 4(x). Apply 3?(3Jt) of the simulation 

lemma to (q , u)'w#«a where q is the start state of 9Jt, and 

x = uw for u€{0,l},w€{0,l} . (We ignore the case x = \.) This 

yields a y-expression which we claim is the desired output. 

Since 931 requires space at most t, (n) , we conclude that Comp(93t,y) where 

f (n) f (n) 

y = b «(q ,u)"w«b is nonempty iff x is accepted by 931 Hence 

x 6 A »xis n ot accepted by 931 « Comp(93i,y) = » L(0) = » p 6 Empty CQ. 

This verifies our claim that p is a correct output. 
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As in the preceding lemma, the Turing machine transforming x to (3 
requires space at most a polynomial in 4(x). Q.E.D. 

This completes the lemmas required for Theorem 2. 

It is not hard to extend this argument to obtain Theorem 1. We 
use a stronger form of Fact 3 due to Blum [ 1 ] to obtain from the proof 
of Theorem 2 more information about the frequency of the (i.o.) condition 
in the statement that Comp (WSIS ) > t (i.o.). 

K. 

Theorem 3 : The following decidable full and weak second order theories 
are not elementary-recursive: two successors, countable linear order, 
countable well-order, unary function with countable domain, unit interval 
under £. Also, first order theory of two successors with length and 
prefix predicates, and the first order theory of <N,+,E>, where P(x,y) 
= [x is a power of two and x divides y], are decidable but not elementary! 

These results follow by reasonably straightforward efficient reductions 
of WSIS to each of these theories. 

Y-expressions are themselves of interest as a decidable but non- 
elementary word problem. 

Corollary: Empty (f 0,1} ) is not elementary-recursive. 

Further remarks : 

(1) The results and methods described here were developed in May, 1972. [9] 
This paper is a revised version of a preliminary report with the same 
title written at that time. Since then, in collaboration with 



t ~~~ 

The decidability of these theories is shown in [6,12]. 
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M.J. Fischer, M.O. Rabin, and L. Stockmeyer, J. Ferrante and C. Rackoff, 
close upper and lower bounds on space or time have been obtained for most 
of the classical decidable theories in logic as well as for various 
notations related to ^-expressions. 

Some of the more interesting results to appear in forthcoming papers 
are 

(i) (Meyer) The satisfiability problem for sentences in the first 

order theory of linear order is not elementary; in fact space 

t (n) is required for some € > 0. WSIS also requires this 
t • n n 

much space. An upper bound t (n) follows from Rabin's 

c» n 

proof that S 2 S is decidable [12]! 
(ii) (Stockmeyer) The emptiness problem for expressions involving 
only the operation of U, • , -i is not elementary, that is, the 
y-operation is unnecessary. The simulation lemma and its proof 
become considerably more subtle, 
(iii) (Fischer-Rabin) Any decision procedure for the first-order 
theory of <N,+>, that is, Presburger's arithmetic, requires 
t (€*n) steps even on nondeterministic Turing machines. 
Ferrante and Rackoff [7], following Cooper [5] and Oppen[ll], 
have established an upper bound of space t (6»n). 
(iv) (Fischer-Rabin) Any decision procedure for the first order 

theory of <N-{0), •> requires time t (£«n) even on nondeterministic 
Turing machines. Rackoff has shown that space t (en) is 
sufficient. 

- _____ 

In [12], Rabin inaccurately claims his decision procedure is elementary. 



In a personal communication, he has informed me that he was aware that 
his procedure required sj 

definition of elementary. 



his procedure required space t (n) , but that he misunderstood the 

c« n 
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(v) (Fischer) Let S be any class of structures with a binary 

associative operator * and the property that for arbitrarily 
large n there exists s 6 S £ § such that 

s n * s m for 1 £ m < n, 

where s = s « s ■><•... -,? s. Then any decision procedure for 

m 
satisfiability over § of sentences in the first order language 

of * requires t (G«n) steps. This general result applies to 

nearly all the familiar decidable theories in logic, except for 

the propositional calculus and pure equality. 

(vi) (Meyer) The decision problem for satisfiability of sentences in 

monadic predicate calculus with only seven (approximately) 

quantifiers requires time t (€«n) even on nondeterministic 

Turing machines; time t.(c»n) is achievable on nondeterministic 

Turing machines. 

(vii) (Fischer-Meyer) The decision problem for satisfiability of 

sentences in the first order language of a single monadic 

function is not elementary. 

(2) Abstract complexity theory has been open to the criticism of being 

unable to exhibit "natural" decision problems in which phenomena 

such as speed-up appeared. Applying Blum's results [2] on effective 

speed-up to our simulation of Turing machines via WSIS . we can show 

that given any decision procedure for WSIS , one can effectively 

construct a new decision procedure for WSIS which is much faster 

(faster by t for any k) than the given procedure on at least one 
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sentence of length n for all sufficiently large integers n. Similar results 
apply to the other decision procedures mentioned above. 

(3) The relation eff can be characterized in a manner similar to 

the definition of the elementary functions or the primitive recursive 

2.5 
functions. e , so called because it lies properly between the Grzegorczyk 

2 3 
classes e and e , is defined inductively as follows: 

1. x ^ y, x+y, x-y, x 110 ^ € e*' 5 , 

2.5 

2. e is closed under explicit transformation (substituting 

constants and renaming or identifying variables), 

o 2.5 . , J , 

-5- e is closed under composition of functions, and 

2,5 

4. e is closed under limited recursion, limited sum and limited 

..... t 
minimization. 

5. That's all. 

it 

If we identify words in £ with the integers they represent in |s|-adic 
notation, and for any set A c jf let C.: N -4 {0,1} be the characteristic 
function of the set of integers identified with A, then B eff A if and only 

if C A (x) = C (f(x)) for some f 6 e 2 ' 5 and all x € N. 

2. 5 
Essentially e ' provides a high-level programming language in which 

one can formally express the procedures we informally claimed could be 

carried out by polynomial space-bounded Turing machines. In this manner 

our proof could be presented in a completely formal fashion without appeal 

to intuition about the space requirements of computations. We prefer the 

latter approach. 



t 
See Grzegorczyk' s paper for definitions. [8] . Closure under limited 

recursion actually implies closure under limited sum and limited 

minimization. 



.S'^^^SMpi^ 
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